18.785: Analytic Number Theory, MIT, spring 2007 (K.S. Kedlaya) 

Prime /c-tuples 

This unit begins the third part of the course, in which we apply the results gathered 
in the first two parts in order to say something about the extent to which primes cluster 
together in short intervals. 

Reference cited below: P.X. Gallagher, On the distribution of primes in short interval, 
Mathematika 23 (1976), 4-9; corrigendum, ibid. 28 (1981), 86. (I couldn't find this online.) 

1 The Hardy-Littlewood /c-tuples conjecture 

Let H denote a fc-tuple of distinct integers. What does one expect about the distribution of 
the integers n such that n + h is prime for each h e 7i? 

Here is a rather simple-minded guess. The prime number theorem suggests that if one 
chooses a random integer of size x, it will be prime with probability l/(loga;). If one then 
chooses k distinct integers of size x, and there is no obvious reason why they cannot all be 
prime, then one might expect them to be simultaneously prime with probability log"'^ x, and 
the number of such tuples with terms bounded by x should be asymptotic to x log~'^ x, with 
the constant 1. 

However, this turns out not to be the correct constant, as is easily verified against experi- 
mental evidence in the case of twin primes. The reason is perhaps obvious: the facts that the 
different n + h are coprime to a fixed prime p are not independent, and one needs to account 
for this. Here is the recipe for doing so proposed by Hardy-Littlewood (and mentioned by 
Ben Green in his guest lecture). 

Fix a prime p. The probability that k randomly chosen integers are all not divisible by 
p is (1 — 1/p)''. On the other hand, the probability that the n + h are all coprime to p is 
1 ~ V'nip)/p, where v-nip) is the number of residue classes modulo p represented by elements 
of H. We thus set 

e(H,^n(i-^)(i-i)-^ 

(called the "singular series" , because it occurs in the Hardy-Littlewood circle method as a 
series summed over singularities of some integral) and conjecture as follows. 

Conjecture 1 (Hardy-Littlewood). Suppose that vnip) < P for all p. Then the number of 
integers n < x such that n + h is prime for each h E Ti. is asymptotic to &{H)x log~*^ x. 

Of course if Vfi{p) = for some p, then there is a trivial obstruction created by divisibility 
mod p, so you only get finitely many prime /c-tuples of that shape. On the other hand, if 
unip) < P for all p, then the product converges absolutely and so > 0. 

Gonvention: it will be convenient later to take the same definition for &(J~C) even if H. 
does not have distinct entries. 
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2 A:-tuples and prime gaps 



If one is only interested in looking for primes which are close together, without specifying 
exactly what the gaps are, one could go back to the probabilistic model (attributed to 
Cramer, more famous for his rule for solving linear systems). It suggests that the distribution 
of 7r(n + h) — 7r(n), ior n < N and h ~ AlogA^ with A fixed, should approach a Poisson 
distribution with parameter A as cxo. The fact that this follows from a suitably uniform 
version of the fc-tuples conjecture is due to Gallagher; the main part of the argument is the 
following result, which we will need later. 

Theorem 2 (Gallagher). We have 



In other words, the fudge factor 6(7Y) between the probabilistic model and the Hardy- 
Littlewood prediction averages out to 1, so the prediction based on the probabilistic model is 
consistent with Hardy-Littlewood. (Note: the contribution from tuples not having distinct 
entries is 0(a;^^^), so it doesn't matter whether we include them or not.) 

Here is a sketch of Gallagher's proof, with the missing details left as exercises. (Through- 
out, keep k fixed.) Put 



'H€{l,...,x}'' 




an{p) = a{p,vn{p)) 



so that 



e{n)^ll{i + an{p)). 



p 



Extend a by multiplicativity to squarefree arguments d, so that 



e(7^) = E««(^) 



d 



with the sum on the right being absolutely convergent. 



We can truncate the sum over d by showing that for each fixed e > 0, 




(1) 



ne{i,...,x}'' 



d<y H 



with the constant depending only on /c, e and not on x, y (exercise). 
For any given d, we can rewrite the inner sum of (1) as a sum 
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where v runs over vectors indexed by the prime factors of d, with e {1, . . . ,p} for each 
and f(i{x,v) counts /c-tuples e {1, . . . ,xY which occupy exactly v{p) residue classes 
modulo p for each p\d. 

Write I ^ } for the number of partitions of an a-element set into b unordered parts (Stirling 
number of the second kind) . If we set 

.(.)^En'.&'-W)U))KP):{4} 

^w=Eni<'ta"W)i(X))*"{4} 

V p\d 

then 

J2 an{d) = {x/d)''A{d) + 0{{x/df-^B{d)) + 0{x^-^C{d)). (2) 

■H 

From this, plus the identities 




it is not difficult to deduce Theorem 2. 



Exercises 

1. Prove that for k a positive integer, 

log"'' tdt X \og~^ X. 



i: 
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2. Prove (1). (If you get stuck, see the hint for problem 5.) 

3. Prove (2). (Hint: it might help to think in terms of counting lattice points.) 

4. Prove the identities (3), (4). 

5. Complete the proof of Theorem 2 from (1) and (2). (Hint: first use the Stirling number 
identities to calculate A{d). Then estimate B{d) and C{d), using the bound 



|a(p, m)\ < 



c{k)ip — 1) ^ m — k 
c{k){p — 1)"^ m < k. 



That is, the constant c{k) depends on k but not on p or m. Finally, take y — x^^^ in 
(I)-) 
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